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Abstract 

In this note, we introduce a class of indicators that enable to compute efficiently optimal 
transport plans associated to arbitrary distributions of N demands and N supplies in R 
in the case where the cost function is concave. The computational cost of these indicators 
is small and independent of N. A hierarchical use of them enables to obtain an efficient 
algorithm. 
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I. INTRODUCTION 



It is well known that transport problems on the line involving convex cost func- 
tions have explicit solutions, consisting in a monotone rearrangement. Recently, 
an efficient method has been introduced to tackle this issue on the circle {4]. In 
this note we introduce an algorithm that enables to tackle optimal transport prob- 
lems on the line (but actually also on the circle) with concave costs. Our algorithm 
complements the method suggested by McCann [2J. McCann considers general real 
values of supply and demand and shows how the problem can be reduced to convex 
optimization somewhat similar to the simplex method in linear programming. Our 
approach as presented here is developed for the case of unit masses and is closer to 
the purely combinatorial approach of [lj], but extends it to a general concave cost 



function. The extension to integer masses will be presented in 
The method we propose is based on a class of local indicators, that allow to detect 
consecutive points that are matched in an optimal transport plan. Thanks to the 
low number of evaluations of the cost function required to apply the indicators, we 
derive an algorithm that finds an optimal transport plan in n 2 operations in the 
worst case. In practice, the computational cost of this method appears to behave 
linearly with respect to n. 

Since the indicators apply locally, the algorithm can be massively parallelized and 
also allows to treat optimal transport problems on the circle. In this way, it extends 
the work of Aggarwal et al. [1] in which cost functions have a linear dependence in 
the distance. 

II. SETTING OF THE PROBLEM 

For iVo G N*, consider P = (pi)i=i t ... t N an d Q = (li)i=i,...,N two sets of points 
in R that represent respectively demand and supply locations. The problem we 
consider in this note consists in minimizing the transport cost 

C ( a ) = 5^ c (K>9Wi)), (1) 

where a is a permutation of {1, No}- This permutation forms a transport plan. 
We focus on the case where the function c involves a concave function as stated in 
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the next definition. The cost function in ([I]) is defined on R by c(p, q) = g(\p — q\) 
with p, q e R, where g(-) is a concave non-decreasing real- valued function of a real 
positive variable such that g(0) := linx^o g( x ) > — °°- Some examples of such 
costs are given by g(x) = log(x) with g(0) = — oo, and g(x) = \fx or g(x) = \x\ is 
with 0(0) = 0. 

Finally, we denote by a* the permutation associated to a given optimal transport 
plan between P and Q: for all permutation a of {1, iVo}, C(o~*) < C(o~). 



III. CHAINS 



In this section, we present a way to build a particular partition of the set PUQ. 
Consider two pairs of matched points (jp^ q a *(i)) and (jv, q a *(i')), say e.g. pi < q a *{i), 
Pi' < q<r*(i')- It is easy to prove that the following alternative holds: 

i- \pi,q*+(i)\ n \pi>,q<T*(i>))] = 0, 

This remark is a direct consequence of the concavity of the cost function and is 
often denominated as "the non-crossing rule" [l, 2]. In the next section, we show 
how it allows decompose the initial situation in sub-problems where supply and 
demand points are alternated. 

Because of the non-crossing rule in an optimal plan there are as many supply points 
as demand points between any pair of matched points Pi and q a ^ . For a given de- 
mand point Pi, define its left neighbor q[ as the nearest supply point on the left 
of pi such that the numbers of supply and demand points between q\ and pi are 
equal; define the right neighbor q" of Pi in a similar way. Then define a chain as a 
maximal alternating sequence of supply and demand points (p^, q^jP^, qj k ) or 
(qji,Pi 2 -> ■■■■>Pi k +i) sucn that each q it is the right neighbor of pi t and the left neighbor 
of Pi l+1 . An extension of the proof of Lemma 3 of pj shows that the collection of 
chains forms a partition of the set PUQ. An simple example of such a partition is 
shown on Fig. [TJ Note that construction of this collection only depends on relative 
positions of supply and demand points and does not involve any evaluation of the 
cost function. It can be done in O(N ) operations. 



FIG. 1: Example of chains. 



The non-crossing rule implies that all matched pairs of points in an optimal trans- 
port plan must belong to the same chain. We therefore restrict ourselves in the 
sequel, without loss of generality, to the case of a single chain 

pi < qi < ... <pi <qi < p i+1 < q i+ i < ... < p N < q N , (2) 

for JVeN' and keep these last notations throughout the rest of this paper. 

IV. LOCAL MATCHING INDICATORS 

Thanks to the non-crossing rule, one knows that there exists at least two con- 
secutive points (pi, q^ or (qi,Pi+i) that are matched in any optimal transport plan. 
Starting from this remark, we take advantage of the structure of a chain to intro- 
duce a class of indicators that enable to detect a priori such pairs of points. We 
define 

fe-l k 

I k (i) = c(pi, q i+k ) + c (Pi+e+i, Qi+e) ~ Yl c (Pi+^ 
e=o e=o 

where k, i are such that 1 < k < N — 1 and 1 < % < N — k, and 

k k 

= c(p i+k+1 , qt) + Y c (Pi+e, Qi+e) ~ Y c (^+^+i' 

for k, i G N, such that 1 < k < N — 2 and l<i<JV — fc — 1. The interest of 
these functions lies in the next result. 

Theorem 1 Let k G N with 1 < k < N — 1 and i G N (resp. i' G N), such 
that 1 < i < N - k (resp. 1 < i' < N - k - I). 
Assume that 
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< 



FIG. 2: Schematic representation of the result stated in Thm.[T]in the case where k = 1. 

1- I P k ii) > for k = 1, fc - 1, 1 < * < N - k, 

2- I 9 k (i') > for k = 1, jfcb-1, l<i'<iV-jfe-l, 

Taen an?/ permutation a* associated to an optimal transport plan satisfies o~*(i) = 
i — 1 for i = i + 1, ...,z + &o (Vesp. cr*(i) = ? for % = i + 1, ...,z + &o/ 

In practice, these indicators allow to find pairs of neighbors that are matched in 
an optimal transport plan. This result is illustrated on Fig. [2j 
Before giving the proof, we state a basic result. 

Lemma 2 We keep the previous notations. Define 

k-l fc-1 
£=0 £=1 

/or fc,j 6 N, snc/i i/iat 1 < A; < N — 1 and 1 < i < N — k, and 

k fc-i 

t=\ 1=1 
for k, i G N, snc/i iaai 1 < A; < iV — 2 and 1 < i < iV — A; — 1. 5oia functions 
< fki( x ^y) an d <Pki( x 'V) are decreasing with respect to each of their two variables. 
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This lemma is a direct consequence of the concavity of the function g. We are now 
in the position to give the sketch of the proof of Theorem [TJ 

Proof of Theorem [TJ We consider the case where (io) < 0. The case 
7^ o (zq) < can be treated the same way. 

The proof consists in proving that Assumptions ([IHS]) imply that neither demand 
nor supply points located between p io and p io+ k +i can be matched with points 
located outside this interval, i.e. that the set S io = {pi, i +l < i < io+ko}U{qi, i < 
i < + ko — 1} is stable by an optimal transport plan. In this case, the result 
follows from Assumption ([IH2])- 

Suppose that S io is not preserved by an optimal transport plan a*. Three cases 
can occur: 

a) There exists i\ £ N, such that 1 < i\ < io and io < a*(ii) < io + ko — 1 and 
there exists i[ £ N, such that <J*(ii) + 1 < i[ < io + ko and io + k < cr*(i[) < N. 

b) There exists i<i £ N, with i + 1 < ii < io + k such that 1 < o"*(^) < _ 1- 

c) There exists ii £ N, with i + ko < %2 < N such that i < o"*^) < ^0 + ^o- 
We first prove that Case [a]) cannot occur. 

In Cased]), one can assume without loss of generality that cr*(ii) is the largest index 
such that 1 < i\ < io, io < cr*(ii) < io + k — 1 and that i[ is the smallest index such 
that (7*{i\) + 1 < i'i < io + io + ko < cr*(i[) < N . With such assumptions, the 
(possibly empty) subset {pi, er*(ii) + 1 < i < i[ — 1} U {qi, cr*(ii) + 1 < i < i[ — 1} 
is stable by a*. Because of Assumptions ([IH^D, no nesting can occur in this subset, 
and cr*{i) = i for i = <J*(ii) + 1, ...,i[ — 1. 

On the other hand, since a* is supposed to be optimal, one has: 




Thanks to Lemma [2J one deduces from this last inequality that: 




l=(7*(£l)+l 



i=a*(i 1 ) 
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and then: 

i=io i=i' 1 
A ~ 1 io+fco-l 

+ X c(pi,ft) < c(pi ,g io+ fc ) + ^ c(p i+1 ,gi). (3) 

l=<J*(l 1 )+l «=«0 

According to Assumption (pQ), -^*(ti)-i (*°) — ^ anc ^ ^+fc -i' (*i) — ^' so that: 

<j*(ii) o-*(ii)-l 

X c(pi,qi) < c(p io ,q a *( h) ) + Y c (Pi+i^i) 

i=io i=io 
io+ko io+ko— 1 

!%0+fco)+ X^ c (Pi+l)9i)- 

Combining these last inequalities with ([3]) one finds that: 

io+ko io+ko— 1 

ioi qio+ko) + Y C (Pi+li<li)i 
i=io i=io 

which contradicts Assumption ([31). 

Let us now prove that Cases |b]) and [cj) contradict the assumptions. Cases El) and 
[cj) can be treated in the same way. Consider Case|b]). Without loss of generality, 
one can assume that i 2 is the smallest index such that i$ + 1 < i 2 < iq + k and 
cr *(^) < *o — 1- Because there are necessarily as much demands as supplies between 
q io and p i2 , there exists one and only one index i' 2 such that i < cr*^) < i 2 — 1 and 
1 < i 2 < io- Consequently, the (possibly empty) subsets {pi, io + 1 < i < cr*(z' 2 )} U 
{ qi ,i < i < a*(i' 2 )-l} and {pi,a*(i' 2 ) + l < i < i 2 -l}U{q h <r*(i£) + l < i < i 2 -l} 
are stable by an optimal transport plan. Because of Assumptions (p~H2]), no nesting 
can occur in these subsets, and <J*(i) — i — 1 for % — i + 1, <J*(i 2 ) and cr*(i) = i 
for % = a*(i' 2 ) + 1, i 2 - 1. 

On the other hand, since a* is supposed to be optimal, one has 

<r*{i' 2 ) i 2 -l 

c(Pi 2 ,qa*(i 2 )) + c(pi' 2 ,qa*(i' 2 )) + Y C ( pi > + C (P*' ®) 

i=«0+l i=cr+(i' 2 )+l 
fa 

<c{Pi' 2 ,q (T *{i 2 ))+ Y c (PiiQi-i)- 

i=io+l 
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Thanks to Lemma [21 one deduces from this last inequality that: 

0*(*2) 12-1 

c(Pi 2 ,qa*(i 2 )) + c(p io , q<r*(i>j) + Y c fe' ft- 1 ) + 53 ft) 

i=«o+l i=a*(i' 2 )+l 

) + 53 c fe) Qi-l)- (4) 
i=io+l 

Because the cost is supposed to be increasing with respect to the distance, one 
finds that c(p io , q a *(i 2 )) < c(p i2 , q a (i 2 )), so that (SJ implies: 

C*(» 2 ) »2-l «2 

c(Pi ,9«7*(^)) + 5^ c fe'ft-i)+ 53 c fe'ft)^ 53 c (^'ft-i)' 

i=io+l i=a*(i' 2 )+l i=io+l 

and then: 

ff*(*2) h-l io+ko 

c(p l0 ,qa*(i> 2 )) + Y c (Pi>9i-i)+ 53 c fe'ft)+ 53 c fe'ft-0 

i=io+l i=a*(i' 2 )+l i=i2+l 

io+ka 

< 53 c (^>ft-0- (5) 

i=«0+l 

According to Assumption (JTJ ^*(i')-j ^°) — ^' so that: 

<7*(i' 2 ) «T*(^)-1 

53 c(pi,fc) < c(p io ,g CT * ( ^)) + c(p i+1 ,qi). 

i=io i=io 

Combining these last inequalities with (jSJ) one finds that: 

io+ko io+ko— 1 

53 c (^<?i) < c(p 

53 C fe+l'ft)' 
i=«0 i=io 

which contradicts Assumption Q. 

We have then shown that neither demand nor supply points located between pi 
and gj 0+ fc +i can be matched with located outside this interval. The set <Sj is then 
stable by an optimal transport plan. According to Assumption ([IH2]), no nesting 
can occur in S io . The result follows. □ 



V. AN ALGORITHM FOR BALANCED CHAINS 

The recursive use of our indicators is on the basis of the next algorithm. 
Algorithm: Set V = { Pl , ...,p N , q u q N }, & = (1,...,A), & = (1,...,A), and 
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k = l. 

While V + and k < N - 1 do 

1. Compute and Ilii') for i = 1, N — k and = 1, N — k — 1. 

2. Define 

2| = {^o,l<*o<A r -A:,/ fe p (zo)<0}, 
11 = {«o, 1 < io < N - k - 1, I q h (i ) < 0}, 

and do 

(a) If 2£ = and = 0, set k = k + 1. 

(b) Else do 

• for all io in Z£ and for i — i + 1, ...,i + k, do 

— define o*{$) = Ci, 

— remove , qy? } from V, 

— remove l\ and l\ from £ p and £ q respectively. 

• for all i' in X\ and for i = i' + 1, i' Q + fc, do 

— define <r*(^) = l\, 

— remove {pi, q{\ from V, 

— remove and if from ^ p and £ q respectively. 

• set iV = 7;Card(V), and rename the points in V such that V = 

{pi, ■■■,PN,qu -,qN}, 

Pi < gi < ... <Pi<qi< p i+1 < q i+1 < ... < p N < q N . 

• set A; = 1. 

If A; = JV - 1, for i = 1, iV set = if. 

To test the efficiency of our algorithm, we have applied it to an increasing 
number N of pairs of points. For a fixed value of N, 100 samples of points have 
been chosen randomly in [0,1], and the mean of the number of evaluations of g 
has been computed. The results are shown on Fig. [31 
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The best case consists in finding a negative indicator at each step, and the worst 
corresponds to the case where all the indicators are positive. These two cases 
require respectively N — 1 and (N — l) 2 evaluations of g. 



1(1 
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FIG. 3: Number of in-line evaluations with respect to the number of pairs, for various 
cost functions. The number a is the slope of the log-log graphs. 



[1] A. Aggarwal, A. Bar-Noy, S. Khuller, D. Kravets, and B. Schieber. Efficient minimum 
cost matching using quadrangle inequality. In Foundations of Computer Science, 
1992. Proceedings of 33rd Annual Symposium, pages 583-592, 1992. 

[2] R. McCann. Exact solutions to the transportation problem on the line. Pro- 
ceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 
455(1984):1341-1380, 1999. 

[3] J. Delon, J. Salomon, and A. Sobolevskii. Local matching indicators for concave 
transport costs, in preparation. 

[4] J. Delon, J. Salomon, and A. Sobolevskii. Fast transport optimization for Monge 
costs on the circle (2009), preprint hal-00362834, submitted. 



10 



